Generating Aspect-oriented Multi-Document Summarization with Event-aspect model
نویسندگان
چکیده
In this paper, we propose a novel approach to automatic generation of aspect-oriented summaries from multiple documents. We first develop an event-aspect LDA model to cluster sentences into aspects. We then use extended LexRank algorithm to rank the sentences in each cluster. We use Integer Linear Programming for sentence selection. Key features of our method include automatic grouping of semantically related sentences and sentence ranking based on extension of random walk model. Also, we implement a new sentence compression algorithm which use dependency tree instead of parser tree. We compare our method with four baseline methods. Quantitative evaluation based on Rouge metric demonstrates the effectiveness and advantages of our method.
منابع مشابه
Exploiting aspectual features and connecting words for summarization-inspired temporal-relation extraction
This paper presents a model that incorporates contemporary theories of tense and aspect and develops a new framework for extracting temporal relations between two sentence-internal events, given their tense, aspect, and a temporal connecting word relating the two events. A linguistic constraint on event combination has been implemented to detect incorrect parser analyses and potentially apply s...
متن کاملMulti-Document Summarization with Subjectivity Analysis
In this paper, we present our team TUT/NII results at DUC 2005 and additional experiments on improving multi-document summarization. Summarization systems have typically focused on the factual aspect of information needs. Subjectivity analysis is another essential aspect for better understanding of information needs. Our approach is based on sentence extraction, weighted by sentence type annota...
متن کاملGuided Summarization with Aspect Recognition
As a continuation of the summarization track of TAC 2010, the TAC 2011 summarization track pursues aspect-guided summarization. It is intended “to encourage a deeper linguistic (semantic) analysis of the source documents instead of relying only on document word frequencies to select important concepts”. The sustained interest in guided summarization marks a significant turn to semantically orie...
متن کاملComments-Oriented Document Summarization Based on Multi-aspect Co-feedback Ranking
With the popularity of Web 2.0, comments left by readers on web documents have drawn much attention. In this paper, we study the problem of comments-oriented document summarization, which aims to summarize a web document by considering not only its content but also the comments. Generally, most of the comments usually convey one or a few aspects of the document. Given a sentence set from both t...
متن کاملFine Granular Aspect Analysis using Latent Structural Models
In this paper, we present a structural learning model for joint sentiment classification and aspect analysis of text at various levels of granularity. Our model aims to identify highly informative sentences that are aspect-specific in online custom reviews. The primary advantages of our model are two-fold: first, it performs document-level and sentence-level sentiment polarity classification jo...
متن کامل